distfit | distfit is a python library for probability density fitting | Analytics library
kandi X-RAY | distfit Summary
kandi X-RAY | distfit Summary
Python package for probability density function fitting of univariate distributions of non-censored data
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
- Apply smoothing to the data
- Compute the fit score
- Compute confidence interval
- Compute the model for the given distribution
- Plot the distribution
- Plot empirical quantile
- Plot the parameter distribution
- Plot binom
- Predict the distribution
- Perform a multtest fitting on the distribution
- Predict percentile
- Compute the probability of the fitted distribution
- Fit the model to data
- Computes the best fitting of the given data
- Clean the model results
- Creates a dictionary
- Plot a summary of the distribution
- Save model to file
- Load a model from a file
- Generate n samples
distfit Key Features
distfit Examples and Code Snippets
Community Discussions
Trending Discussions on distfit
QUESTION
I have used distfit library to find the best distribution that will fit my data to avoid skewness. Let us say, I have transformed my data into Normal distribution using the boxcox method.
After this, shall I scale my data, for example, using Robust Scaler that handles outliers very well.
I am confused that I should be following both the steps or just one.
Not sure, if I am heading in the right direction in the data prep phase. please share your thoughts on this. Thanks!
...ANSWER
Answered 2021-Sep-23 at 05:09You might or might have to do scaling after Normalization.
Answer depends on what are we doing to this data. e.g. Are we planing to fit some model? or anything else?
One concrete example is:
If want to train our model for Neural Networks, then let see:
- For faster convergence of training: We should have mean= 0 and sigma=1 (Normalization needed)
- For effective regularization, you mush have all the data features at similar scale. (Scaling needed)
On contrast, if you want to fit say Decision Tree, then neither of these things are needed.
So, it all boils down to what we have to do after processing the data.
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install distfit
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page